Automatic partitioning of signal processing programs for symmetric multiprocessors
نویسندگان
چکیده
Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the IEEE. One such application domain is on-board signal processing. The Strategic Defense Initiative Organization (SDIO) Signal and Data Processing Benchmark Suite [Nic91] is a set of kernels drawn from aerospace and military systems that perform functions such as radar and sonar signal processing. These kernels are CPU intensive. Because they have real-time scheduling demands, minimizing latency is critical. Traditionally, these applications run on custom-designed distributed-memory systems that have up to hundreds of processors operating in a coarse-grained data flow fashion, as illustrated in Figure 1. The application is usually partitioned by hand into stages that are spread across the processors. Code is generated separately for each processor and communication and synchronization among these processes is explicitly specified by the programmer. Frequently, the poor quality or unavailability of compilers for the specialized processors used requires hand-coding in assembly language. An alternative to this approach is to replace a large number of potentially specialized processors in these systems with a much smaller number of SMP systems. Projected trends in advanced CMOS technology indicate the availability of small-scale SMPs on a single chip or MCM at relatively low prices in the near future. It is inevitable that aerospace and military on-board signal processing systems will need to leverage the price/performance benefits of such commercial SMP systems. There is a crucial need for compilation tools that can take existing application software, possibly in the form of compiled object code, and partition and schedule it for efficient execution on such systems. Such tools will facilitate the porting of on-board signal processing sequential programs for parallel execution on these modern high-performance SMP systems. The specific focus of this paper is to investigate the feasibility of automatically parallelizing on-board signal and data processing applications for parallel execution on commercial SMPs. We present the PEDIGREE compiler, Figure 1. High-level view of typical on-board signal processing sensor-based image processing feature-based signal processing command and control data processing Abstract Symmetric multiprocessor systems are increasingly common , not only as servers, but as a vehicle for executing a single application in parallel in order to reduce its execution latency. This paper presents …
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملUsing Symmetric Multiprocessor Architectures for High Performance Computing Environments
Performance enhancement for high speed computing can be carried out by using many techniques and architectures at software and high hardware level. Performance enhancement using hardware techniques may include the use of multiple computing nodes or a single node consisting of multiple processors. Symmetric multiprocessor is one of the modern architectures used to perform extensive computations....
متن کاملAutomated Parallelization of Non-uniform Convolutions on Chip Multiprocessors
This paper introduces an approach for automatic parallelization of unequally-spaced convolutions on chip multiprocessors (CMPs). CMPs are very promising candidates for digital processing in signal and image systems with high throughput and low power consumption, compared to uniprocessor based architectures. As CMPs are emerging and evolving in increasing diversity and complexity, automated para...
متن کاملUsing Utilization Pro les in Allocation and Partitioning for Multiprocessor Systems
The problems of multiprocessor partitioning and program allocation are interdependent and critical to the performance of multiprocessor systems Minimizing resource partitions for parallel programs on partitionable multiprocessors facilitates greater processor utilization and throughput The pro cessing resource requirements of parallel programs vary during program execution and are allocation de...
متن کاملComparing Gang Scheduling with Dynamic Space Sharing on Symmetric Multiprocessors Using Automatic Self-Allocating Threads (ASAT)
This work considers the best way to handle a diverse mix of multi-threaded and single-threaded jobs running on a single Symmetric Parallel Processing system. The traditional approaches to this problem are free scheduling, gang scheduling, or space sharing. This paper examines a less common technique called dynamic space sharing. One approach to dynamic space sharing, Automatic Self Allocating T...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996